NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

The ML-SUPERB 2.0 Challenge: Towards Inclusive ASR Benchmarking for All Language Varieties

https://doi.org/10.21437/Interspeech.2025-1013

Chen, William; Meng, Chutong; Shi, Jiatong; Bartelds, Martijn; Wang, Shih-Heng; Wang, Hsiu-Hsuan; Mosquera, Rafael; Hincapie, Sara; Jurafsky, Dan; Anastasopoulos, Antonis; et al (August 2025, ISCA)

Free, publicly-accessible full text available August 17, 2026
Advancing science- and evidence-based AI policy

https://doi.org/10.1126/science.adu8449

Bommasani, Rishi; Arora, Sanjeev; Chayes, Jennifer; Choi, Yejin; Cuéllar, Mariano-Florentino; Fei-Fei, Li; Ho, Daniel E; Jurafsky, Dan; Koyejo, Sanmi; Lakkaraju, Hima; et al (July 2025, Science)

Policy must be informed by, but also facilitate the generation of, scientific evidence
more » « less
Free, publicly-accessible full text available July 31, 2026
Using Large Language Models to Promote Health Equity

https://doi.org/10.1056/AIp2400889

Pierson, Emma; Shanmugam, Divya; Movva, Rajiv; Kleinberg, Jon; Agrawal, Monica; Dredze, Mark; Ferryman, Kadija; Gichoya, Judy Wawira; Jurafsky, Dan; Koh, Pang Wei; et al (January 2025, NEJM AI)

Free, publicly-accessible full text available January 23, 2026
ML-SUPERB 2.0: Benchmarking Multilingual Speech Models Across Modeling Constraints, Languages, and Datasets

https://doi.org/10.21437/Interspeech.2024-2248

Shi, Jiatong; Wang, Shih-Heng; Chen, William; Bartelds, Martijn; Bannihatti_Kumar, Vanya; Tian, Jinchuan; Chang, Xuankai; Jurafsky, Dan; Livescu, Karen; Lee, Hung-yi; et al (September 2024, ISCA)

Full Text Available
Assessing the potential of GPT-4 to perpetuate racial and gender biases in health care: a model evaluation study

https://doi.org/10.1016/S2589-7500(23)00225-X

Zack, Travis; Lehman, Eric; Suzgun, Mirac; Rodriguez, Jorge A; Celi, Leo Anthony; Gichoya, Judy; Jurafsky, Dan; Szolovits, Peter; Bates, David W; Abdulnour, Raja-Elie E; et al (January 2024, The Lancet Digital Health)

Full Text Available
Leveraging supplementary text data to kick-start automatic speech recognition system development with limited transcriptions

San, Nay; Bartelds, Martijn; Billings, Blaine; de Falco, Ella; Feriza, Hendi; Safri, Johan; Sahrozi, Wawan; Foley, Ben; McDonnell, Bradley; Jurafsky, Dan (January 2023, Proceedings of the Sixth Workshop on the Use of Computational Methods in the Study of Endangered Languages)

Full Text Available
Sensitivity as a Complexity Measure for Sequence Classification Tasks

https://doi.org/10.1162/tacl_a_00403

Hahn, Michael; Jurafsky, Dan; Futrell, Richard (January 2021, Transactions of the Association for Computational Linguistics)

Abstract We introduce a theoretical framework for understanding and predicting the complexity of sequence classification tasks, using a novel extension of the theory of Boolean function sensitivity. The sensitivity of a function, given a distribution over input sequences, quantifies the number of disjoint subsets of the input sequence that can each be individually changed to change the output. We argue that standard sequence classification methods are biased towards learning low-sensitivity functions, so that tasks requiring high sensitivity are more difficult. To that end, we show analytically that simple lexical classifiers can only express functions of bounded sensitivity, and we show empirically that low-sensitivity functions are easier to learn for LSTMs. We then estimate sensitivity on 15 NLP tasks, finding that sensitivity is higher on challenging tasks collected in GLUE than on simple text classification tasks, and that sensitivity predicts the performance both of simple lexical classifiers and of vanilla BiLSTMs without pretrained contextualized embeddings. Within a task, sensitivity predicts which inputs are hard for such simple models. Our results suggest that the success of massively pretrained contextual representations stems in part because they provide representations from which information can be extracted by low-sensitivity decoders.
more » « less
Full Text Available
Social Bias Frames: Reasoning about Social and Power Implications of Language

https://doi.org/10.18653/v1/2020.acl-main.486

Sap, Maarten; Gabriel, Saadia; Qin, Lianhui; Jurafsky, Dan; Smith, Noah A; Choi, Yejin (July 2020, Association for Computational Linguistics)

Full Text Available
The Diversity–Innovation Paradox in Science

https://doi.org/10.1073/pnas.1915378117

Hofstra, Bas; Kulkarni, Vivek V.; Munoz-Najar Galvez, Sebastian; He, Bryan; Jurafsky, Dan; McFarland, Daniel A. (April 2020, Proceedings of the National Academy of Sciences)

Prior work finds a diversity paradox: Diversity breeds innovation, yet underrepresented groups that diversify organizations have less successful careers within them. Does the diversity paradox hold for scientists as well? We study this by utilizing a near-complete population of ∼1.2 million US doctoral recipients from 1977 to 2015 and following their careers into publishing and faculty positions. We use text analysis and machine learning to answer a series of questions: How do we detect scientific innovations? Are underrepresented groups more likely to generate scientific innovations? And are the innovations of underrepresented groups adopted and rewarded? Our analyses show that underrepresented groups produce higher rates of scientific novelty. However, their novel contributions are devalued and discounted: For example, novel contributions by gender and racial minorities are taken up by other scholars at lower rates than novel contributions by gender and racial majorities, and equally impactful contributions of gender and racial minorities are less likely to result in successful scientific careers than for majority groups. These results suggest there may be unwarranted reproduction of stratification in academic careers that discounts diversity’s role in innovation and partly explains the underrepresentation of some groups in academia.
more » « less
Full Text Available
Framing and Agenda-setting in Russian News: a Computational Analysis of Intricate Political Strategies

Field, Anjalie; Kliger, Doron; Wintner, Shuly; Pan, Jennifer; Jurafsky, Dan; Tsvetkov, Yulia (October 2018, 2018 Conference on Empirical Methods in Natural Language Processing (EMNLP))

Amidst growing concern over media manipulation, NLP attention has focused on overt strategies like censorship and “fake news”. Here, we draw on two concepts from the political science literature to explore subtler strategies for government media manipulation: agenda-setting (selecting what topics to cover) and framing (deciding how topics are covered). We analyze 13 years (100K articles) of the Russian newspaper Izvestia and identify a strategy of distraction: articles mention the U.S. more frequently in the month directly following an economic downturn in Russia. We introduce embedding-based methods for cross-lingually projecting English frames to Russian, and discover that these articles emphasize U.S. moral failings and threats to the U.S. Our work offers new ways to identify subtle media manipulation strategies at the intersection of agenda-setting and framing.
more » « less
Full Text Available

Search for: All records